Àá½Ã¸¸ ±â´Ù·Á ÁÖ¼¼¿ä. ·ÎµùÁßÀÔ´Ï´Ù.
KMID : 1022420150070040041
Phonetics and Speech Sciences
2015 Volume.7 No. 4 p.41 ~ p.47
A Study on Word Vector Models for Representing Korean Semantic Information
Yang He-Jung

Lee Young-In
Lee Hyun-Jung
Cho Sook-Wha
Koo Myoung-Wan
Abstract
This paper examines whether the Global Vector model is applicable to Korean data as a universal learning algorithm. The main purpose of this study is to compare the global vector model (GloVe) with the word2vec models such as a continuous bag-of-words (CBOW) model and a skip-gram (SG) model. For this purpose, we conducted an experiment by employing an evaluation corpus consisting of 70 target words and 819 pairs of Korean words for word similarities and analogies, respectively. Results of the word similarity task indicated that the Pearson correlation coefficients of 0.3133 as compared with the human judgement in GloVe, 0.2637 in CBOW and 0.2177 in SG. The word analogy task showed that the overall accuracy rate of 67% in semantic and syntactic relations was obtained in GloVe, 66% in CBOW and 57% in SG.
KEYWORD
GloVe, Korean corpus, semantic similarity, vector synthesis
FullTexts / Linksout information
Listed journal information
ÇмúÁøÈïÀç´Ü(KCI)